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Abstract 



The production of W bosons decaying into a tau lepton and a neutrino with the tau 
lepton decaying hadronically has been observed in LHC pp collisions at a/s = 7 TeV 
with the CMS detector. The selection criteria provide a statistically significant signal 
on the top of QCD multi-jet and electroweak backgrounds. A data-driven method for 
the estimation of the QCD multi-jet background has been employed. 
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1 Introduction 

Tau leptons serve as an important probe for many new physics processes at the LHC. Among 
others, experimental signatures that involve decays to tau leptons are crucial for searches of a 
light Higgs boson, Supersymmetry or extra dimensions. 

Tau leptons can decay either leptonically via t — )■ Ivv (l=e or mu, branching fraction is 36%) or 
into a hadronic jet and one tau-neutrino. Hadronic decay modes (Thad) produce a highly coUi- 
mated tau-jet signature, characterized by a low particle multiplicity that allows their separation 
from QCD-jets. 

In the framework of the standard model, tau leptons are mostly produced in decays of elec- 
troweak vector bosons: Z — )■ t+t^ and — > t^v. These processes have relatively large 
cross sections and are among the largest sources of tau leptons at LHC. The W — )■ t^i/ channel 
benefits from a large production cross section, exceeding the production rate of Z — > t+t^ by 
nearly an order of magnitude. However, the experimental signature of a single tau-jet and un- 
detected neutrino is challenging, requiring a good understanding of the tau identification and 
missing transverse energy (£^'®®). 

The study of W — > ti/ production in the T^^d^ final state is an important calibration sample for 
understanding tau identification and reconstruction. Also, — > t^v production has to be 
well understood as a test of the standard model and as a measure of important background 
process in several searches for new physics. In particular, it is the major background in the 
search for the charged Higgs boson in the tv final state. 

This study of W — > r^ad'^ production has been conducted using 18.4 ± 0.7 pb^^ of collision data 
from the 2010 LHC run at v^s = 7 TeV recorded with the CMS detector. See Ref. [Tl for a mea- 
surement of the cross section for Z — > t+t^ production including tau-leptons reconstructed in 
the Thad final state. 

2 Physics objects reconstruction 

The particle flow (PF) reconstruction algorithm implemented at CMS [2] is used for identi- 
fication of jets, muons, electrons, taus and E^®^. The PF technique utilizes the information 
from the whole event, aiming to provide a global event description at the level of individually 
reconstructed particles. Firstly, all tracks and energy clusters are reconstructed in each sub- 
detector. Next, all the candidates are associated in an optimal combination to one or more of 
these sub-detector signals, if they are compatible with the physics properties of each particle, 
and reconstructed in the event. The final set of particles (charged hadrons, neutral hadrons, 
photons, electrons and muons) is used to derive composite physics objects such as Thad/ jets 
and £^'^®. The PF jets are clustered using the anti-A:^ jet clustering algorithm |3| with distance 
parameter R = 0.5. 

Typically, r^ad is a highly-collimated jet comprising one or three charged mesons (predomi- 
nantly TT^) and possibly one or two neutral pions always decaying via — ?► 77. The identifi- 
cation of Thad from W boson decays requires a robust algorithm and an efficient set of selection 
criteria, as it is one of the main discriminators against large QCD jet background. 

The Thad identification algorithm used here is known as the Hadrons Plus Strips Algorithm 
(HPS) [4J. HPS starts from a high-pj charged hadron and combines it with other nearby 
charged or neutral hadrons to reconstruct t decay modes. The identification of n^s is enhanced 
by clustering the PF electrons and photons in "strips" along the bending plane to take into ac- 



2 



4 QCD Background Estimation 



count possible broadening of calorimeter signatures because of photon conversions. 

3 Event selection 

The following list of offline selection criteria is applied for the final event selection: 

• There must be at least one HPS Thad candidate with pj > 30 GeV and \rj\ < 2.3, 
and the leading track in the Thad candidate must have pj > 15 GeV. Three different 
working points (Figure [T| for the isolation has been defined [HI. The definition of 
the medium, which has been used in this analyis follows as: there must be no PF 
charged hadron or photon candidates with pj > 0.8 GeV within an isolation cone 
of size AR = 0.5, (Those candidates which are associated to the tau decay signature 
are excluded.) 

• Several cuts has been applied in order to rejects those electron and muons which fake 
taus. Furthermore we also veto the events which include good electron or muon. 
This cut supresses the W+Jet events where W decays either to muon or electron and 
jet fakes tau. 

• We require E^^^ > 35 GeV and we consider PF jets in an event with > 15 GeV 
and \rj\ < 3, and compute the ratio, Rht, of the pr of the Thad candidate to the sum 
of the Pt of the PF jets. We require Rht > 0.65. 

Further details about event selection can be found in elsewhere ||6l. 
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Figure 1: The measured fake rate as a function of efficiency evaluated using simulation for all 
working points for QCD m-enriched and W data samples. The PTDR points represent results 
of the fixed cone algorithm based on the PF taus 




4 QCD Background Estimation 

QCD events are the dominant background contribution to the final event sample. This back- 
ground cannot be reliably estimated from simulation, so a data-driven method is used. 

In the so-called "ABCD method," four regions are designated in a phase space defined by E^^^ 
and Rht- We start with an event sample obtained with no cuts on E^^^^ and Rbt, and then 
divide it into four subsamples as follows 

• region A where Rht > 0.65 and £^'®® > 35 GeV. This region is dominated by signal; 
we want to account for QCD background here. 
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• region B where Rht > 0.65 and < 35 GeV 

• region C where Kht < 0.65 and E^'^^ < 35 GeV 

• region D where Rht < 0.65 and > 35 GeV. 

In order to apply this method, we must assume that the event subsamples in regions B, C and D 
are dominated by QCD events, and there is a low statistical correlation between Rht and £^'®®. 
All other backgrounds have been neglected and no corrections have been applied due to the 
signal contribution in the B, C and D regions. 

Figure [2] illustrates the transverse mass distributions of Thad candidates and E^'^^ in regions B, 
C and D. One sees that indeed these regions are dominated by QCD background. The signal 
contribution is less than 1% in region C, and less than 5% and 10% in regions B and D, respec- 
tively. It has been shown [61 that the level of correlation between Rht arid £^'®® is sufficiently 
low to yield accurate background estimation using the ABCD method 

We estimate the yield of QCD background events in the signal region A from the numbers of 
events observed in the other regions. Specifically, we assume Na = {Nd x Ng) / Nq, and obtain 
Na = 109 ± 6 events, where the uncertainty is statistical only. 
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Figure 2: Transverse mass distributions of the Thad candidate and E^^^^ for the four designated 
regions in phase space: Region B (bottom left) where Rht > 0.65 and £^'** < 35 GeV, Region C 
(upper left) where Rht < 0.65 and E^^® < 35 GeV, and Region D (upper right) where Rht < 
0.65 and E^'^* > 35 GeV. The points represent the data. Simulated signal and electroweak 
backgrounds are represented by the filled histograms. 



5 Results 

After all selections, the expected yield of W — > tv events as well as electroweak background 
contributions are estimated using simulation while the QCD multi-jet background is estimated 
from the ABCD method described above. With the selections used in this analysis, number of 
signal event is estimated to be 174 ± 3 (stat), the number of electroweak backgrounds (domi- 
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nated by W — > ei/ ) is estimated as 46 ± 2 (stat) and the QCD multijet contribution is 109 ± 6 
(stat). The number of events observed in data is 372. 

It should be mentioned that we have not yet assessed systematic uncertainties on the back- 
ground predictions or on the signal efficiency. 

The shape of the transverse mass distribution for QCD multi-jet events is estimated using a 
data-driven method. The strategy is to relax some cuts to move into a region where QCD is 
dominant, and normalize this shape to the number of QCD events estimated with the ABCD 
method. Figure |3] shows that when changing the isolation criterion or the Rht cut (from 0.1 
to 0.5), the QCD shape does not change drastically. We decided to use a working point where 
the contribution of electroweak processes and signal events under the mass peak is reduced to 
15%, loosening the cut on Rht from 0.65 to 0.3 and using a looser isolation requirement. 

Figure|4]shows the transverse mass distribution for the final event sample, with the data-driven 
estimate of the QCD contamination. 
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Figure 3: Effect of changing the Rht and isolation criteria on the shape of Transverse Mass of 
Thad and Ef'\ 
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Figure 4: Transverse Mass of Thad and £^'®^ after all cuts 



6 Summary 

We have statistically significant signal for W — > Thad^^r with the T-lepton reconstructed in its 
hadronic decay modes, using 18.4 ± 0.7 pb^^ of data collected by the CMS Collaboration. 
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